45 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
<Not Specified>
Size:
15.3 GByte Production Status:
Newly created-in progress
Use:
Language Modelling
-
Paper title:MirasText: An Automatically Generated Text Corpus for Persian
-
Paper track:Infrastructural Issues/Large Projects
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Behnam Sabeti | Sharif University of Technology | IR |
| Author 2 | Hossein Abedi Firouzjaee | Amirkabir university of technology | IR |
| Author 3 | Ali Janalizadeh Choobbasti | Miras Technologies International | IR |
| Author 4 | Seyed hani elamahdi Mortazavi Najafabadi | Researcher | IR |
| Author 5 | Amir Vaheb | Miras Technologies Company | IR |
| Main Contact | Behnam Sabeti | Sharif University of Technology | None |
Documentation:
<Not Specified>
Written
Treebank,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike 4.0 International license
Size:
152871 words Production Status:
Newly created-in progress
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies for Persian
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Mojgan Seraji | Uppsala University, Department of Linguistics and Philology | SE | ||
| Author 2 | Filip Ginter | University of Turku, Department of Information Technology | FI | ||
| Author 3 | Joakim Nivre | Uppsala University, Department of Linguistics and Philology | SE | Uppsala University | None |
| Main Contact | Mojgan Seraji | Uppsala University, Department of Linguistics and Philology | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
From Owner
License:
<Not Specified>
Size:
10000 <Not Specified>Production Status:
Newly created-in progress
Use:
<Not Specified>
-
Paper title:Corpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
-
Paper track:Terminology
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Somayeh Bagherbeygi | <Not Specified> | None | ||
| Author 2 | Mehrnoush Shamsfard | Shahid Beheshti University | None | ||
| Main Contact | Mehrnoush Shamsfard | Faculty of Electrical and Computer Engineering, Shahid Beheshti University | IR | Shahid Beheshti University | IR |
Documentation:
<Not Specified>
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
Open source
Size:
10.6 <Not Specified>Production Status:
Newly created-finished
Use:
General
-
Paper title:A Basic Language Resource Kit for Persian
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Mojgan Seraji | Uppsala University | None | ||
| Author 2 | Beáta Megyesi | Uppsala University | None | ||
| Author 3 | Joakim Nivre | Uppsala University, Department of Linguistics and Philology | SE | Uppsala University | None |
| Main Contact | Mojgan Seraji | Uppsala University | SE |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
<Not Specified>
Size:
1.6 MByte Production Status:
Existing-updated
Use:
Named Entity Recognition
-
Paper title:BiLSTM-CRF for Persian Named-Entity Recognition ArmanPersoNERCorpus: the First Entity-Annotated Persian Dataset
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Hanieh Poostchi | University of Technology Sydney | AU | University of Technology Sydney | N/A |
| Author 2 | Ehsan Zare Borzeshi | Capital Markets CRC | AU | Capital Markets Cooperative Research Centre | N/A |
| Author 3 | Massimo Piccardi | University of Technology Sydney | AU | ||
| Main Contact | Hanieh Poostchi | University of Technology Sydney UTS | None |
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
LGPL-LR
Size:
1000 entries Production Status:
Existing-updated
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Extending the coverage of a MWE database for Persian CPs exploiting valency alternations
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Pollet Samvelian | Université Sorbonne nouvelle | FR |
| Author 2 | Pegah Faghiri | Université Sorbonne nouvelle | DE |
| Author 3 | Sarra El Ayari | Université Paris Diderot | FR |
| Main Contact | Pegah Faghiri | University of Cologne | None |
Documentation:
Yes. English
Written
Language Resources/Technologies Infrastructure,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
Freely Available
License:
Creative Commons
Size:
151671 tokens Production Status:
Existing-updated
Use:
Syntactic parsing
-
Paper title:A Persian Treebank with Stanford Typed Dependencies
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Mojgan Seraji | Uppsala University, Department of Linguistics and Philology | SE | ||
| Author 2 | Carina Jahani | Uppsala University, Department of Linguistics and Philology | SE | ||
| Author 3 | Beáta Megyesi | Uppsala University, Department of Linguistics and Philology | SE | ||
| Author 4 | Joakim Nivre | Uppsala University, Department of Linguistics and Philology | SE | Uppsala University | None |
| Main Contact | Mojgan Seraji | Uppsala University, Department of Linguistics and Philology | None |
Documentation:
The Uppsala Persian Dependency Treebank Annotation Guidelines
Written
Corpus,
Language Type:
Multilingual
Languages:
Iranian Persian
Availability:
From Owner
License:
The Islamic Sciences Computer Research Centre (NOOR)
Size:
29982 sentences Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Persian Proposition Bank
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Azadeh Mirzaei | Assistant Professor of Allameh Tabataba’i University | IR |
| Author 2 | Amirsaeid Moloodi | Assistant Professor of Shiraz University | IR |
| Main Contact | Azadeh Mirzaei | Assistant Professor of Allameh Tabataba’i University | None |
Documentation:
<Not Specified>
Speech
Corpus,
Language Type:
Multilingual
Languages:
English Iranian Persian
Availability:
Freely Available
License:
OpenSource
Size:
20 GByte Production Status:
Newly created-in progress
Use:
Person Identification
-
Paper title:MirasVoice: A bilingual (English-Persian) speech corpus
-
Paper track:Speech
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Amir Vaheb | Miras Technologies Company | IR |
| Author 2 | Ali Janalizadeh Choobbasti | Miras Technologies Company | IR |
| Author 3 | Mahdi Mortazavi | Miras Technologies Company | IR |
| Author 4 | Saeid Safavi | University of Hertfordshire | GB |
| Author 5 | Behnam Sabeti | Sharif University of Technology | IR |
| Main Contact | Amir Vaheb | Miras Technologies Company | None |
Documentation:
Yes, English and PersianLanguage Type:
Multilingual
Languages:
Iranian Persian Tajik
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> <Not Specified>Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Tajik-Farsi Persian Transliteration Using Statistical Machine Translation
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Chris Irwin Davis | University of Texas at Dallas | None | ||
| Main Contact | Chris Irwin Davis | University of Texas at Dallas | US | The University of Texas at Dallas | US |
Documentation:
<Not Specified>




